Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 6108 |
| Missing cells | 7195 |
| Missing cells (%) | 6.5% |
| Duplicate rows | 6 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 859.1 KiB |
| Average record size in memory | 144.0 B |
Variable types
| NUM | 14 |
|---|---|
| CAT | 4 |
Reproduction
| Analysis started | 2020-07-26 10:17:40.319376 |
|---|---|
| Analysis finished | 2020-07-26 10:18:32.780068 |
| Duration | 52.46 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Country_Region has constant value "US" | Constant |
| Dataset has 6 (0.1%) duplicate rows | Duplicates |
Province_State has a high cardinality: 59 distinct values | High cardinality |
Last_Update has a high cardinality: 107 distinct values | High cardinality |
Active is highly correlated with Confirmed | High correlation |
Confirmed is highly correlated with Active and 1 other fields | High correlation |
People_Hospitalized is highly correlated with Confirmed and 1 other fields | High correlation |
Deaths is highly correlated with People_Hospitalized | High correlation |
ISO3 is highly correlated with Province_State | High correlation |
Province_State is highly correlated with ISO3 | High correlation |
Lat has 228 (3.7%) missing values | Missing |
Long_ has 228 (3.7%) missing values | Missing |
Recovered has 1477 (24.2%) missing values | Missing |
Incident_Rate has 228 (3.7%) missing values | Missing |
People_Tested has 228 (3.7%) missing values | Missing |
People_Hospitalized has 2200 (36.0%) missing values | Missing |
Mortality_Rate has 123 (2.0%) missing values | Missing |
Testing_Rate has 228 (3.7%) missing values | Missing |
Hospitalization_Rate has 2200 (36.0%) missing values | Missing |
Confirmed has 123 (2.0%) zeros | Zeros |
Deaths has 240 (3.9%) zeros | Zeros |
Recovered has 130 (2.1%) zeros | Zeros |
Active has 94 (1.5%) zeros | Zeros |
Incident_Rate has 105 (1.7%) zeros | Zeros |
Mortality_Rate has 117 (1.9%) zeros | Zeros |
| Distinct count | 59 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.7 KiB |
| Maryland | 105 |
|---|---|
| Washington | 105 |
| Arizona | 105 |
| Kansas | 105 |
| Kentucky | 105 |
| Other values (54) |
| Value | Count | Frequency (%) | |
| Maryland | 105 | 1.7% | |
| Washington | 105 | 1.7% | |
| Arizona | 105 | 1.7% | |
| Kansas | 105 | 1.7% | |
| Kentucky | 105 | 1.7% | |
| South Dakota | 105 | 1.7% | |
| Mississippi | 105 | 1.7% | |
| Hawaii | 105 | 1.7% | |
| Nevada | 105 | 1.7% | |
| Florida | 105 | 1.7% | |
| Other values (49) | 5058 | 82.8% |
Length
| Max length | 24 |
|---|---|
| Median length | 8 |
| Mean length | 9.292239686 |
| Min length | 4 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.7 KiB |
| US |
|---|
| Value | Count | Frequency (%) | |
| US | 6108 | 100.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
| Distinct count | 107 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 19 |
| Missing (%) | 0.3% |
| Memory size | 47.7 KiB |
| 2020-04-30 02:32:31 | 59 |
|---|---|
| 2020-04-25 06:32:46 | 59 |
| 2020-04-29 02:32:33 | 59 |
| 2020-04-27 02:32:46 | 59 |
| 2020-04-21 23:40:34 | 59 |
| Other values (102) |
| Value | Count | Frequency (%) | |
| 2020-04-30 02:32:31 | 59 | 1.0% | |
| 2020-04-25 06:32:46 | 59 | 1.0% | |
| 2020-04-29 02:32:33 | 59 | 1.0% | |
| 2020-04-27 02:32:46 | 59 | 1.0% | |
| 2020-04-21 23:40:34 | 59 | 1.0% | |
| 2020-04-26 02:32:45 | 59 | 1.0% | |
| 2020-04-22 23:40:26 | 59 | 1.0% | |
| 2020-04-28 02:32:46 | 59 | 1.0% | |
| 2020-04-24 03:33:00 | 59 | 1.0% | |
| 2020-07-26 04:35:13 | 58 | 0.9% | |
| Other values (97) | 5500 | 90.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 18.95022921 |
| Min length | 3 |
| Distinct count | 58 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 228 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.840089285714285 |
|---|---|
| Minimum | -14.271 |
| Maximum | 61.3707 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | -14.271 |
|---|---|
| 5-th percentile | 15.0979 |
| Q1 | 34.5946 |
| median | 39.06185 |
| Q3 | 42.36165 |
| 95-th percentile | 47.4009 |
| Maximum | 61.3707 |
| Range | 75.6417 |
| Interquartile range (IQR) | 7.76705 |
Descriptive statistics
| Standard deviation | 10.79030945 |
|---|---|
| Coefficient of variation (CV) | 0.2928958551 |
| Kurtosis | 7.667000536 |
| Mean | 36.84008929 |
| Median Absolute Deviation (MAD) | 4.15675 |
| Skewness | -2.153915943 |
| Sum | 216619.725 |
| Variance | 116.4307781 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 31.0545 | 105 | 1.7% | |
| 43.3266 | 105 | 1.7% | |
| 35.5653 | 105 | 1.7% | |
| 36.1162 | 105 | 1.7% | |
| 39.8494 | 105 | 1.7% | |
| 38.3135 | 105 | 1.7% | |
| 33.0406 | 105 | 1.7% | |
| 37.7693 | 105 | 1.7% | |
| 38.5266 | 105 | 1.7% | |
| 37.6681 | 105 | 1.7% | |
| Other values (48) | 4830 | 79.1% | |
| (Missing) | 228 | 3.7% |
| Value | Count | Frequency (%) | |
| -14.271 | 101 | 1.7% | |
| -14.271 | 4 | 0.1% | |
| 13.4443 | 105 | 1.7% | |
| 15.0979 | 105 | 1.7% | |
| 18.2208 | 105 | 1.7% |
| Value | Count | Frequency (%) | |
| 61.3707 | 105 | 1.7% | |
| 47.5289 | 105 | 1.7% | |
| 47.4009 | 105 | 1.7% | |
| 46.9219 | 105 | 1.7% | |
| 45.6945 | 105 | 1.7% |
| Distinct count | 58 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 228 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -85.20661431972789 |
|---|---|
| Minimum | -170.1322 |
| Maximum | 145.6739 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | -170.1322 |
|---|---|
| 5-th percentile | -152.4044 |
| Q1 | -101.165775 |
| median | -87.9442 |
| Q3 | -76.970625 |
| 95-th percentile | -64.8963 |
| Maximum | 145.6739 |
| Range | 315.8061 |
| Interquartile range (IQR) | 24.19515 |
Descriptive statistics
| Standard deviation | 49.3124053 |
|---|---|
| Coefficient of variation (CV) | -0.5787391706 |
| Kurtosis | 14.38077386 |
| Mean | -85.20661432 |
| Median Absolute Deviation (MAD) | 11.6672 |
| Skewness | 3.415406708 |
| Sum | -501014.8922 |
| Variance | 2431.713316 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| -119.6816 | 105 | 1.7% | |
| -91.8678 | 105 | 1.7% | |
| -82.7649 | 105 | 1.7% | |
| -157.4983 | 105 | 1.7% | |
| -81.6868 | 105 | 1.7% | |
| -114.4788 | 105 | 1.7% | |
| -75.5071 | 105 | 1.7% | |
| -98.2681 | 105 | 1.7% | |
| -88.9861 | 105 | 1.7% | |
| -77.0268 | 105 | 1.7% | |
| Other values (48) | 4830 | 79.1% | |
| (Missing) | 228 | 3.7% |
| Value | Count | Frequency (%) | |
| -170.1322 | 1 | < 0.1% | |
| -170.132 | 104 | 1.7% | |
| -157.4983 | 105 | 1.7% | |
| -152.4044 | 105 | 1.7% | |
| -122.0709 | 105 | 1.7% |
| Value | Count | Frequency (%) | |
| 145.6739 | 105 | 1.7% | |
| 144.7937 | 105 | 1.7% | |
| -64.8963 | 105 | 1.7% | |
| -66.5901 | 105 | 1.7% | |
| -69.3819 | 105 | 1.7% |
| Distinct count | 5097 |
|---|---|
| Unique (%) | 83.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34447.20415848068 |
|---|---|
| Minimum | 0 |
| Maximum | 446452 |
| Zeros | 123 |
| Zeros (%) | 2.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 49 |
| Q1 | 2400 |
| median | 11538 |
| Q3 | 37734.75 |
| 95-th percentile | 146184.55 |
| Maximum | 446452 |
| Range | 446452 |
| Interquartile range (IQR) | 35334.75 |
Descriptive statistics
| Standard deviation | 63444.33794 |
|---|---|
| Coefficient of variation (CV) | 1.841784827 |
| Kurtosis | 15.82596407 |
| Mean | 34447.20416 |
| Median Absolute Deviation (MAD) | 10917.5 |
| Skewness | 3.708679636 |
| Sum | 210403523 |
| Variance | 4025184017 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 123 | 2.0% | |
| 49 | 105 | 1.7% | |
| 103 | 105 | 1.7% | |
| 30 | 23 | 0.4% | |
| 69 | 22 | 0.4% | |
| 14 | 18 | 0.3% | |
| 22 | 12 | 0.2% | |
| 31 | 9 | 0.1% | |
| 66 | 8 | 0.1% | |
| 72 | 7 | 0.1% | |
| Other values (5087) | 5676 | 92.9% |
| Value | Count | Frequency (%) | |
| 0 | 123 | 2.0% | |
| 11 | 3 | < 0.1% | |
| 13 | 3 | < 0.1% | |
| 14 | 18 | 0.3% | |
| 15 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 446452 | 1 | < 0.1% | |
| 440185 | 1 | < 0.1% | |
| 430773 | 1 | < 0.1% | |
| 421286 | 1 | < 0.1% | |
| 414511 | 1 | < 0.1% |
| Distinct count | 2464 |
|---|---|
| Unique (%) | 40.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1707.6835297969876 |
|---|---|
| Minimum | 0 |
| Maximum | 32608 |
| Zeros | 240 |
| Zeros (%) | 3.9% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 70 |
| median | 383 |
| Q3 | 1465.5 |
| 95-th percentile | 6885.2 |
| Maximum | 32608 |
| Range | 32608 |
| Interquartile range (IQR) | 1395.5 |
Descriptive statistics
| Standard deviation | 4154.764284 |
|---|---|
| Coefficient of variation (CV) | 2.432982582 |
| Kurtosis | 31.02185064 |
| Mean | 1707.68353 |
| Median Absolute Deviation (MAD) | 371 |
| Skewness | 5.18860849 |
| Sum | 10430531 |
| Variance | 17262066.26 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 240 | 3.9% | |
| 2 | 107 | 1.8% | |
| 5 | 106 | 1.7% | |
| 3 | 102 | 1.7% | |
| 17 | 90 | 1.5% | |
| 6 | 78 | 1.3% | |
| 56 | 50 | 0.8% | |
| 10 | 45 | 0.7% | |
| 16 | 44 | 0.7% | |
| 9 | 38 | 0.6% | |
| Other values (2454) | 5208 | 85.3% |
| Value | Count | Frequency (%) | |
| 0 | 240 | 3.9% | |
| 1 | 8 | 0.1% | |
| 2 | 107 | 1.8% | |
| 3 | 102 | 1.7% | |
| 4 | 15 | 0.2% |
| Value | Count | Frequency (%) | |
| 32608 | 1 | < 0.1% | |
| 32596 | 1 | < 0.1% | |
| 32594 | 1 | < 0.1% | |
| 32558 | 1 | < 0.1% | |
| 32520 | 1 | < 0.1% |
| Distinct count | 2869 |
|---|---|
| Unique (%) | 62.0% |
| Missing | 1477 |
| Missing (%) | 24.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11816.05031310732 |
|---|---|
| Minimum | 0.0 |
| Maximum | 221510.0 |
| Zeros | 130 |
| Zeros (%) | 2.1% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 775 |
| median | 3317 |
| Q3 | 13356 |
| 95-th percentile | 55318 |
| Maximum | 221510 |
| Range | 221510 |
| Interquartile range (IQR) | 12581 |
Descriptive statistics
| Standard deviation | 20449.06161 |
|---|---|
| Coefficient of variation (CV) | 1.730617344 |
| Kurtosis | 16.69242177 |
| Mean | 11816.05031 |
| Median Absolute Deviation (MAD) | 2951 |
| Skewness | 3.3304013 |
| Sum | 54720129 |
| Variance | 418164120.7 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 130 | 2.1% | |
| 19 | 38 | 0.6% | |
| 12 | 21 | 0.3% | |
| 61 | 20 | 0.3% | |
| 15642 | 18 | 0.3% | |
| 64 | 16 | 0.3% | |
| 23887 | 16 | 0.3% | |
| 15 | 11 | 0.2% | |
| 51 | 11 | 0.2% | |
| 13 | 11 | 0.2% | |
| Other values (2859) | 4339 | 71.0% | |
| (Missing) | 1477 | 24.2% |
| Value | Count | Frequency (%) | |
| 0 | 130 | 2.1% | |
| 9 | 2 | < 0.1% | |
| 11 | 8 | 0.1% | |
| 12 | 21 | 0.3% | |
| 13 | 11 | 0.2% |
| Value | Count | Frequency (%) | |
| 221510 | 1 | < 0.1% | |
| 212216 | 1 | < 0.1% | |
| 203826 | 1 | < 0.1% | |
| 195315 | 1 | < 0.1% | |
| 186529 | 1 | < 0.1% |
| Distinct count | 4778 |
|---|---|
| Unique (%) | 78.4% |
| Missing | 17 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24064.14644557544 |
|---|---|
| Minimum | -120720.0 |
| Maximum | 438044.0 |
| Zeros | 94 |
| Zeros (%) | 1.5% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | -120720 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 852.5 |
| median | 6450 |
| Q3 | 21021.5 |
| 95-th percentile | 124380 |
| Maximum | 438044 |
| Range | 558764 |
| Interquartile range (IQR) | 20169 |
Descriptive statistics
| Standard deviation | 51953.95978 |
|---|---|
| Coefficient of variation (CV) | 2.158977876 |
| Kurtosis | 17.37413245 |
| Mean | 24064.14645 |
| Median Absolute Deviation (MAD) | 6202 |
| Skewness | 3.850806333 |
| Sum | 146574716 |
| Variance | 2699213937 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 49 | 106 | 1.7% | |
| 100 | 96 | 1.6% | |
| 0 | 94 | 1.5% | |
| 2 | 34 | 0.6% | |
| 9 | 30 | 0.5% | |
| 12 | 21 | 0.3% | |
| 7 | 20 | 0.3% | |
| 23 | 18 | 0.3% | |
| 103 | 15 | 0.2% | |
| 11 | 14 | 0.2% | |
| Other values (4768) | 5643 | 92.4% | |
| (Missing) | 17 | 0.3% |
| Value | Count | Frequency (%) | |
| -120720 | 1 | < 0.1% | |
| -115936 | 1 | < 0.1% | |
| -111424 | 1 | < 0.1% | |
| -106988 | 1 | < 0.1% | |
| -100372 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 438044 | 1 | < 0.1% | |
| 431848 | 1 | < 0.1% | |
| 422572 | 1 | < 0.1% | |
| 413239 | 1 | < 0.1% | |
| 408734 | 1 | < 0.1% |
FIPS
Real number (ℝ≥0)
| Distinct count | 60 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 19 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3257.9022828050583 |
|---|---|
| Minimum | 1.0 |
| Maximum | 99999.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| median | 32 |
| Q3 | 48 |
| 95-th percentile | 78 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 17180.88557 |
|---|---|
| Coefficient of variation (CV) | 5.273603713 |
| Kurtosis | 24.74004635 |
| Mean | 3257.902283 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 5.159946349 |
| Sum | 19837367 |
| Variance | 295182829.1 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 66 | 105 | 1.7% | |
| 20 | 105 | 1.7% | |
| 54 | 105 | 1.7% | |
| 48 | 105 | 1.7% | |
| 44 | 105 | 1.7% | |
| 72 | 105 | 1.7% | |
| 40 | 105 | 1.7% | |
| 36 | 105 | 1.7% | |
| 32 | 105 | 1.7% | |
| 30 | 105 | 1.7% | |
| Other values (50) | 5039 | 82.5% |
| Value | Count | Frequency (%) | |
| 1 | 105 | 1.7% | |
| 2 | 105 | 1.7% | |
| 4 | 105 | 1.7% | |
| 5 | 105 | 1.7% | |
| 6 | 105 | 1.7% |
| Value | Count | Frequency (%) | |
| 99999 | 104 | 1.7% | |
| 88888 | 104 | 1.7% | |
| 999 | 1 | < 0.1% | |
| 888 | 1 | < 0.1% | |
| 78 | 104 | 1.7% |
| Distinct count | 5398 |
|---|---|
| Unique (%) | 91.8% |
| Missing | 228 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 505.3689272359527 |
|---|---|
| Minimum | 0.0 |
| Maximum | 2231.417438587298 |
| Zeros | 105 |
| Zeros (%) | 1.7% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 48.85040923 |
| Q1 | 144.9333641 |
| median | 335.1103651 |
| Q3 | 716.7461476 |
| 95-th percentile | 1555.968115 |
| Maximum | 2231.417439 |
| Range | 2231.417439 |
| Interquartile range (IQR) | 571.8127836 |
Descriptive statistics
| Standard deviation | 478.1219857 |
|---|---|
| Coefficient of variation (CV) | 0.9460850477 |
| Kurtosis | 1.190478593 |
| Mean | 505.3689272 |
| Median Absolute Deviation (MAD) | 233.769541 |
| Skewness | 1.346515774 |
| Sum | 2971569.292 |
| Variance | 228600.6332 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 105 | 1.7% | |
| 54.40301755 | 23 | 0.4% | |
| 64.32486855 | 22 | 0.4% | |
| 25.38807486 | 18 | 0.3% | |
| 39.89554621 | 12 | 0.2% | |
| 56.21645147 | 9 | 0.1% | |
| 61.52813514 | 8 | 0.1% | |
| 67.12160197 | 7 | 0.1% | |
| 44.81753928 | 6 | 0.1% | |
| 93.77150199 | 6 | 0.1% | |
| Other values (5388) | 5664 | 92.7% | |
| (Missing) | 228 | 3.7% |
| Value | Count | Frequency (%) | |
| 0 | 105 | 1.7% | |
| 19.9477731 | 2 | < 0.1% | |
| 19.9477731 | 1 | < 0.1% | |
| 23.57464094 | 3 | < 0.1% | |
| 25.38807486 | 18 | 0.3% |
| Value | Count | Frequency (%) | |
| 2231.417439 | 2 | < 0.1% | |
| 2198.752885 | 1 | < 0.1% | |
| 2186.588608 | 1 | < 0.1% | |
| 2147.370203 | 1 | < 0.1% | |
| 2137.199454 | 1 | < 0.1% |
| Distinct count | 5479 |
|---|---|
| Unique (%) | 93.2% |
| Missing | 228 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 373914.58554421767 |
|---|---|
| Minimum | 3.0 |
| Maximum | 7047355.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 4085.7 |
| Q1 | 51177.25 |
| median | 151820 |
| Q3 | 415379.25 |
| 95-th percentile | 1388184.55 |
| Maximum | 7047355 |
| Range | 7047352 |
| Interquartile range (IQR) | 364202 |
Descriptive statistics
| Standard deviation | 671985.5117 |
|---|---|
| Coefficient of variation (CV) | 1.797163143 |
| Kurtosis | 27.85742761 |
| Mean | 373914.5855 |
| Median Absolute Deviation (MAD) | 126593.5 |
| Skewness | 4.605159266 |
| Sum | 2198617763 |
| Variance | 4.515645279e+11 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 174 | 29 | 0.5% | |
| 3 | 19 | 0.3% | |
| 696 | 14 | 0.2% | |
| 8217 | 10 | 0.2% | |
| 1037 | 10 | 0.2% | |
| 8169 | 9 | 0.1% | |
| 65 | 9 | 0.1% | |
| 124 | 8 | 0.1% | |
| 105 | 8 | 0.1% | |
| 816 | 8 | 0.1% | |
| Other values (5469) | 5756 | 94.2% | |
| (Missing) | 228 | 3.7% |
| Value | Count | Frequency (%) | |
| 3 | 19 | 0.3% | |
| 38 | 2 | < 0.1% | |
| 40 | 3 | < 0.1% | |
| 55 | 1 | < 0.1% | |
| 56 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7047355 | 1 | < 0.1% | |
| 6915876 | 1 | < 0.1% | |
| 6778304 | 1 | < 0.1% | |
| 6664419 | 1 | < 0.1% | |
| 6536932 | 1 | < 0.1% |
| Distinct count | 2462 |
|---|---|
| Unique (%) | 63.0% |
| Missing | 2200 |
| Missing (%) | 36.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5252.702405322416 |
|---|---|
| Minimum | 2.0 |
| Maximum | 89995.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 69 |
| Q1 | 465.75 |
| median | 1629 |
| Q3 | 4665 |
| 95-th percentile | 15097.7 |
| Maximum | 89995 |
| Range | 89993 |
| Interquartile range (IQR) | 4199.25 |
Descriptive statistics
| Standard deviation | 13150.54949 |
|---|---|
| Coefficient of variation (CV) | 2.503577869 |
| Kurtosis | 29.1253213 |
| Mean | 5252.702405 |
| Median Absolute Deviation (MAD) | 1416 |
| Skewness | 5.283137846 |
| Sum | 20527561 |
| Variance | 172936952 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 89995 | 53 | 0.9% | |
| 4 | 17 | 0.3% | |
| 83 | 15 | 0.2% | |
| 4389 | 13 | 0.2% | |
| 67 | 11 | 0.2% | |
| 291 | 11 | 0.2% | |
| 331 | 10 | 0.2% | |
| 65 | 10 | 0.2% | |
| 5285 | 10 | 0.2% | |
| 82 | 10 | 0.2% | |
| Other values (2452) | 3748 | 61.4% | |
| (Missing) | 2200 | 36.0% |
| Value | Count | Frequency (%) | |
| 2 | 4 | 0.1% | |
| 3 | 5 | 0.1% | |
| 4 | 17 | 0.3% | |
| 6 | 3 | < 0.1% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 89995 | 53 | 0.9% | |
| 89861 | 1 | < 0.1% | |
| 89703 | 1 | < 0.1% | |
| 89590 | 1 | < 0.1% | |
| 89400 | 1 | < 0.1% |
| Distinct count | 5354 |
|---|---|
| Unique (%) | 89.5% |
| Missing | 123 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9999160473657813 |
|---|---|
| Minimum | 0.0 |
| Maximum | 70.37037037037038 |
| Zeros | 117 |
| Zeros (%) | 1.9% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.087620167 |
| Q1 | 2.309468822 |
| median | 3.733554581 |
| Q3 | 5.121699088 |
| 95-th percentile | 8.129773028 |
| Maximum | 70.37037037 |
| Range | 70.37037037 |
| Interquartile range (IQR) | 2.812230266 |
Descriptive statistics
| Standard deviation | 2.836388524 |
|---|---|
| Coefficient of variation (CV) | 0.709112014 |
| Kurtosis | 182.2756813 |
| Mean | 3.999916047 |
| Median Absolute Deviation (MAD) | 1.407546285 |
| Skewness | 8.883043544 |
| Sum | 23939.49754 |
| Variance | 8.04509986 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 117 | 1.9% | |
| 2.912621359 | 95 | 1.6% | |
| 6.666666667 | 24 | 0.4% | |
| 8.695652174 | 21 | 0.3% | |
| 14.28571429 | 18 | 0.3% | |
| 9.090909091 | 12 | 0.2% | |
| 6.060606061 | 11 | 0.2% | |
| 6.451612903 | 9 | 0.1% | |
| 3.448275862 | 7 | 0.1% | |
| 5.263157895 | 7 | 0.1% | |
| Other values (5344) | 5664 | 92.7% | |
| (Missing) | 123 | 2.0% |
| Value | Count | Frequency (%) | |
| 0 | 117 | 1.9% | |
| 0.3484320557 | 1 | < 0.1% | |
| 0.354609929 | 1 | < 0.1% | |
| 0.3636363636 | 1 | < 0.1% | |
| 0.4056283989 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 70.37037037 | 1 | < 0.1% | |
| 66.66666667 | 1 | < 0.1% | |
| 65.38461538 | 1 | < 0.1% | |
| 61.53846154 | 2 | < 0.1% | |
| 18.18181818 | 1 | < 0.1% |
UID
Real number (ℝ≥0)
| Distinct count | 59 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76783480.58251473 |
|---|---|
| Minimum | 16.0 |
| Maximum | 84099999.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 580 |
| Q1 | 84000012 |
| median | 84000028 |
| Q3 | 84000042 |
| 95-th percentile | 84000056 |
| Maximum | 84099999 |
| Range | 84099983 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 23547597.39 |
|---|---|
| Coefficient of variation (CV) | 0.3066753059 |
| Kurtosis | 6.73480465 |
| Mean | 76783480.58 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | -2.955096006 |
| Sum | 4.689934994e+11 |
| Variance | 5.54489343e+14 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 84000051 | 105 | 1.7% | |
| 84000018 | 105 | 1.7% | |
| 84000039 | 105 | 1.7% | |
| 84000024 | 105 | 1.7% | |
| 84000008 | 105 | 1.7% | |
| 84000048 | 105 | 1.7% | |
| 84000032 | 105 | 1.7% | |
| 84000017 | 105 | 1.7% | |
| 84000040 | 105 | 1.7% | |
| 84000025 | 105 | 1.7% | |
| Other values (49) | 5058 | 82.8% |
| Value | Count | Frequency (%) | |
| 16 | 105 | 1.7% | |
| 316 | 105 | 1.7% | |
| 580 | 105 | 1.7% | |
| 630 | 105 | 1.7% | |
| 850 | 105 | 1.7% |
| Value | Count | Frequency (%) | |
| 84099999 | 105 | 1.7% | |
| 84088888 | 105 | 1.7% | |
| 84070001 | 18 | 0.3% | |
| 84000056 | 105 | 1.7% | |
| 84000055 | 105 | 1.7% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.7 KiB |
| USA | |
|---|---|
| PRI | 105 |
| GUM | 105 |
| MNP | 105 |
| ASM | 105 |
| Value | Count | Frequency (%) | |
| USA | 5583 | 91.4% | |
| PRI | 105 | 1.7% | |
| GUM | 105 | 1.7% | |
| MNP | 105 | 1.7% | |
| ASM | 105 | 1.7% | |
| VIR | 105 | 1.7% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct count | 5510 |
|---|---|
| Unique (%) | 93.7% |
| Missing | 228 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6251.6253642461315 |
|---|---|
| Minimum | 5.3917075537822825 |
| Maximum | 28356.304534681338 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 5.391707554 |
|---|---|
| 5-th percentile | 739.5992692 |
| Q1 | 2286.517211 |
| median | 5021.443427 |
| Q3 | 9009.045127 |
| 95-th percentile | 15788.9655 |
| Maximum | 28356.30453 |
| Range | 28350.91283 |
| Interquartile range (IQR) | 6722.527915 |
Descriptive statistics
| Standard deviation | 4967.913382 |
|---|---|
| Coefficient of variation (CV) | 0.7946594833 |
| Kurtosis | 1.284749376 |
| Mean | 6251.625364 |
| Median Absolute Deviation (MAD) | 3109.553604 |
| Skewness | 1.169618172 |
| Sum | 36759557.14 |
| Variance | 24680163.37 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 312.7190381 | 29 | 0.5% | |
| 5.391707554 | 17 | 0.3% | |
| 1250.876152 | 13 | 0.2% | |
| 14900.98651 | 10 | 0.2% | |
| 1863.733578 | 10 | 0.2% | |
| 14813.94168 | 9 | 0.1% | |
| 117.8732047 | 9 | 0.1% | |
| 188.7097644 | 8 | 0.1% | |
| 222.8572456 | 8 | 0.1% | |
| 1466.544455 | 8 | 0.1% | |
| Other values (5500) | 5759 | 94.3% | |
| (Missing) | 228 | 3.7% |
| Value | Count | Frequency (%) | |
| 5.391707554 | 17 | 0.3% | |
| 5.391707554 | 2 | < 0.1% | |
| 67.08920137 | 1 | < 0.1% | |
| 68.9104889 | 1 | < 0.1% | |
| 68.9104889 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 28356.30453 | 1 | < 0.1% | |
| 28031.90508 | 1 | < 0.1% | |
| 27988.93735 | 1 | < 0.1% | |
| 27595.65717 | 1 | < 0.1% | |
| 27417.5888 | 1 | < 0.1% |
| Distinct count | 3789 |
|---|---|
| Unique (%) | 97.0% |
| Missing | 2200 |
| Missing (%) | 36.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.955935143257337 |
|---|---|
| Minimum | 1.4184397163120568 |
| Maximum | 38.501189532117365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 1.418439716 |
|---|---|
| 5-th percentile | 6.103887652 |
| Q1 | 9.028964457 |
| median | 11.96966046 |
| Q3 | 16.29782086 |
| 95-th percentile | 22.63829228 |
| Maximum | 38.50118953 |
| Range | 37.08274982 |
| Interquartile range (IQR) | 7.268856399 |
Descriptive statistics
| Standard deviation | 5.335575767 |
|---|---|
| Coefficient of variation (CV) | 0.4118248284 |
| Kurtosis | 0.7444485609 |
| Mean | 12.95593514 |
| Median Absolute Deviation (MAD) | 3.570612648 |
| Skewness | 0.826037297 |
| Sum | 50631.79454 |
| Variance | 28.46836876 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 11.76470588 | 6 | 0.1% | |
| 13.56993737 | 6 | 0.1% | |
| 13.40535827 | 5 | 0.1% | |
| 10.52631579 | 5 | 0.1% | |
| 13.06376361 | 4 | 0.1% | |
| 4.411764706 | 4 | 0.1% | |
| 11.88276146 | 4 | 0.1% | |
| 10.81081081 | 4 | 0.1% | |
| 13.5371179 | 4 | 0.1% | |
| 22.28524948 | 3 | < 0.1% | |
| Other values (3779) | 3863 | 63.2% | |
| (Missing) | 2200 | 36.0% |
| Value | Count | Frequency (%) | |
| 1.418439716 | 3 | < 0.1% | |
| 1.438848921 | 1 | < 0.1% | |
| 2.127659574 | 1 | < 0.1% | |
| 2.205882353 | 3 | < 0.1% | |
| 2.302896301 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 38.50118953 | 1 | < 0.1% | |
| 37.2367935 | 1 | < 0.1% | |
| 35.77702703 | 1 | < 0.1% | |
| 34.72131148 | 1 | < 0.1% | |
| 33.14168378 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Province_State | Country_Region | Last_Update | Lat | Long_ | Confirmed | Deaths | Recovered | Active | FIPS | Incident_Rate | People_Tested | People_Hospitalized | Mortality_Rate | UID | ISO3 | Testing_Rate | Hospitalization_Rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Alabama | US | 2020-04-12 23:18:15 | 32.3182 | -86.9023 | 3563 | 93 | NaN | 3470.0 | 1.0 | 75.988020 | 21583.0 | 437.0 | 2.610160 | 84000001.0 | USA | 460.300152 | 12.264945 |
| 1 | Alaska | US | 2020-04-12 23:18:15 | 61.3707 | -152.4044 | 272 | 8 | 66.0 | 264.0 | 2.0 | 45.504049 | 8038.0 | 31.0 | 2.941176 | 84000002.0 | USA | 1344.711576 | 11.397059 |
| 2 | Arizona | US | 2020-04-12 23:18:15 | 33.7298 | -111.4312 | 3542 | 115 | NaN | 3427.0 | 4.0 | 48.662422 | 42109.0 | NaN | 3.246753 | 84000004.0 | USA | 578.522286 | NaN |
| 3 | Arkansas | US | 2020-04-12 23:18:15 | 34.9697 | -92.3731 | 1280 | 27 | 367.0 | 1253.0 | 5.0 | 49.439423 | 19722.0 | 130.0 | 2.109375 | 84000005.0 | USA | 761.753354 | 10.156250 |
| 4 | California | US | 2020-04-12 23:18:15 | 36.1162 | -119.6816 | 22795 | 640 | NaN | 22155.0 | 6.0 | 58.137726 | 190328.0 | 5234.0 | 2.812020 | 84000006.0 | USA | 485.423868 | 22.961176 |
| 5 | Colorado | US | 2020-04-12 23:18:15 | 39.0598 | -105.3111 | 7307 | 289 | NaN | 7018.0 | 8.0 | 128.943729 | 34873.0 | 1376.0 | 3.955112 | 84000008.0 | USA | 615.389991 | 18.831258 |
| 6 | Connecticut | US | 2020-04-12 23:18:15 | 41.5978 | -72.7554 | 12035 | 554 | NaN | 11481.0 | 9.0 | 337.560483 | 41220.0 | 1654.0 | 4.603241 | 84000009.0 | USA | 1156.148159 | 13.743249 |
| 7 | Delaware | US | 2020-04-12 23:18:15 | 39.3185 | -75.5071 | 1625 | 35 | 191.0 | 1590.0 | 10.0 | 166.878217 | 11103.0 | 190.0 | 2.153846 | 84000010.0 | USA | 1140.214672 | 11.692308 |
| 8 | Diamond Princess | US | 2020-04-12 23:18:15 | NaN | NaN | 49 | 0 | 0.0 | 49.0 | 888.0 | NaN | NaN | NaN | 0.000000 | 84088888.0 | USA | NaN | NaN |
| 9 | District of Columbia | US | 2020-04-12 23:18:15 | 38.8974 | -77.0268 | 1875 | 50 | 493.0 | 1825.0 | 11.0 | 265.675190 | 10640.0 | NaN | 2.666667 | 84000011.0 | USA | 1507.618148 | NaN |
Last rows
| Province_State | Country_Region | Last_Update | Lat | Long_ | Confirmed | Deaths | Recovered | Active | FIPS | Incident_Rate | People_Tested | People_Hospitalized | Mortality_Rate | UID | ISO3 | Testing_Rate | Hospitalization_Rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6098 | Tennessee | US | 2020-07-26 04:35:13 | 35.7478 | -86.6923 | 90796 | 964 | 53808.0 | 36024.0 | 47.0 | 1329.531214 | 1332374.0 | 4196.0 | 1.061721 | 84000047.0 | USA | 19510.031521 | 4.621349 |
| 6099 | Texas | US | 2020-07-26 04:35:13 | 31.0545 | -97.5635 | 390286 | 4990 | 221510.0 | 163786.0 | 48.0 | 1346.004972 | 3306042.0 | NaN | 1.278550 | 84000048.0 | USA | 11401.764271 | NaN |
| 6100 | Utah | US | 2020-07-26 04:35:13 | 40.1500 | -111.8624 | 37623 | 274 | 24390.0 | 12959.0 | 49.0 | 1173.533777 | 500139.0 | 2213.0 | 0.728278 | 84000049.0 | USA | 15600.297945 | 5.882040 |
| 6101 | Vermont | US | 2020-07-26 04:35:13 | 44.0459 | -72.7107 | 1396 | 56 | 1182.0 | 158.0 | 50.0 | 223.721893 | 88816.0 | NaN | 4.011461 | 84000050.0 | USA | 14233.584246 | NaN |
| 6102 | Virgin Islands | US | 2020-07-26 04:35:13 | 18.3358 | -64.8963 | 352 | 7 | 236.0 | 109.0 | 78.0 | 328.150054 | 8253.0 | NaN | 1.988636 | 850.0 | VIR | 7693.813626 | NaN |
| 6103 | Virginia | US | 2020-07-26 04:35:13 | 37.7693 | -78.1700 | 83609 | 2075 | 10800.0 | 70734.0 | 51.0 | 979.542076 | 1010443.0 | 12001.0 | 2.481790 | 84000051.0 | USA | 11838.096781 | 14.353718 |
| 6104 | Washington | US | 2020-07-26 04:35:13 | 47.4009 | -121.4905 | 51849 | 1494 | NaN | 50355.0 | 53.0 | 680.889410 | 883982.0 | 5301.0 | 2.881444 | 84000053.0 | USA | 11608.593844 | 10.223919 |
| 6105 | West Virginia | US | 2020-07-26 04:35:13 | 38.4912 | -80.9545 | 5775 | 103 | 4115.0 | 1557.0 | 54.0 | 322.239191 | 256914.0 | NaN | 1.783550 | 84000054.0 | USA | 14335.542788 | NaN |
| 6106 | Wisconsin | US | 2020-07-26 04:35:13 | 44.2685 | -89.6165 | 47870 | 891 | 37287.0 | 9692.0 | 55.0 | 822.164751 | 860243.0 | 4368.0 | 1.861291 | 84000055.0 | USA | 14774.628618 | 9.124713 |
| 6107 | Wyoming | US | 2020-07-26 04:35:13 | 42.7560 | -107.3025 | 2446 | 25 | 1866.0 | 555.0 | 56.0 | 422.628417 | 48269.0 | 158.0 | 1.022077 | 84000056.0 | USA | 8340.086288 | 6.459526 |